Off-Policy Q-Learning: Set-Point Design for Optimizing Dual-Rate Rougher Flotation Operational Processes

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A two level hierarchical control structure for optimizing a rougher flotation circuit

The control of rougher flotation circuits represents a challenging control problem due to the non linearities, multiple inputs-multiple outputs and the wide variety of disturbances acting on the system. Many concentrators rely on regulatory control loops to maintain a stable operation and on the plant operators to find the best operational results. As a mean of using the operator’s knowledge in...

متن کامل

Optimizing Dissolved Air Flotation Design System

Dissolved Air (Pressure) Flotation-DAF, is a well-established separation process that employs micro-bubbles as a carrier phase. This work shows results concerning bubble generation at low working pressures in modified DAF-units to improve the collection of fragile coagula by bubbles. DAF of Fe (OH)3 (as model) was studied as a function of saturation pressure in the absence and presence of surfa...

متن کامل

Off-policy reinforcement learning for H∞ control design

The H∞ control design problem is considered for nonlinear systems with unknown internal system model. It is known that the nonlinear H∞ control problem can be transformed into solving the so-called Hamilton-Jacobi-Isaacs (HJI) equation, which is a nonlinear partial differential equation that is generally impossible to be solved analytically. Even worse, model-based approaches cannot be used for...

متن کامل

Q($\lambda$) with Off-Policy Corrections

We propose and analyze an alternate approach to off-policy multi-step temporal difference learning, in which off-policy returns are corrected with the current Q-function in terms of rewards, rather than with the target policy in terms of transition probabilities. We prove that such approximate corrections are sufficient for off-policy convergence both in policy evaluation and control, provided ...

متن کامل

Variational Policy for Guiding Point Processes

Temporal point processes have been widely applied to model event sequence data generated by online users. In this paper, we consider the problem of how to design the optimal control policy for point processes, such that the stochastic system driven by the point process is steered to a target state. In particular, we exploit the key insight to view the stochastic optimal control problem from the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Industrial Electronics

سال: 2018

ISSN: 0278-0046,1557-9948

DOI: 10.1109/tie.2017.2760245